Representing Systems with Hidden State

نویسندگان

  • Christopher Hundt
  • Prakash Panangaden
  • Joelle Pineau
  • Doina Precup
چکیده

Introduction Learning and planning under uncertainty is a crucial focus of modern AI research. In this paper, we address the issue of sequential decision making. Much of the work in this field is based on the framework of Partially Observable Markov Decision Processes (POMDPs) (Kaelbling et al., 1998). In this framework, problems are modeled using discrete states and actions. Actions cause stochastic transitions between states. At each time step, a stochastic observation is also generated, based on the current state and the previous action. Much work has been devoted to planning in POMDPs when a model of the system (in terms of the stochastic transitions between states and the probability distributions over observations) is known. Unfortunately, learning POMDPs from data is a very difficult problem. One standard algorithmic solution is expectation maximization (EM) (Chrisman, 1992), but for POMDPs this approach is plagued by local minima (more so than for other probabilistic models) and works poorly in practice unless a good initial model of the system is used (Shatkay & Kaelbling, 1997). History-based methods (McCallum, 1995) often work better in practice but are less general. A lot of recent research has been devoted to finding alternative representations for such systems, e.g., diversity-based representation (Rivest & Schapire, 1994), predictive state representations (PSRs) (Littman et al., 2002) and TD-networks (Sutton & Tanner, 2005). These approaches aim to combine the generality of POMDPs with the ease of learning of history-based methods. The key idea underlying all of these approaches is that the state of the system is not considered as predefined; instead, it is viewed as a sufficient statistic for making (probabilistic) predictions about future trajectories. However, the

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

On the Utility of Decision-Theoretic Hidden Subdialog

A spoken dialog system typically characterizes a domain task with multiple states interconnected by actions or thresholds as transitions between states. As the system attempts to solicit a piece of information from the user, it may have to engage in a hidden subdialog, or error handling within a particular state, before transitioning to a new state. Hidden subdialogs generally center on illocut...

متن کامل

The Duality of State and Observation in Probabilistic Transition Systems

In this paper we consider the problem of representing and reasoning about systems, especially probabilistic systems, with hidden state. We consider transition systems where the state is not completely visible to an outside observer. Instead, there are observables that partly identify the state. We show that one can interchange the notions of state and observation and obtain what we call a dual ...

متن کامل

Evaluating Mental State of Final Year Students Based on POMS Questionnaire and HRV Signal

Final year students are normally encountering high pressing in their study. In view of this fact, this research focuses on determining mental states condition of college student in final year based on the psycho-physiological information. The experiments were conducted in two times, i.e., priorand postgraduation seminar examination. The early results indicated that the student profile of mood s...

متن کامل

Runtime Verification of Stochastic, Faulty Systems

We desire a capability for the lifelong verification of complex embedded systems that degrade over time, such as a semi-autonomous car. The field of runtime verification has developed many tools for monitoring the safety of software systems in real time. However, these tools do not allow for uncertainty in the system’s state or failure, both of which are essential for monitoring hardware as it ...

متن کامل

The Role of Hidden Curriculum on Educational Function of Students(A Case Study in Neyshabour High-school Students)

This study studies the role of hidden curriculum on educational function of students in 2011-2012. It also studies the hidden curriculum (social setting) on educational function among high- school students in Nayshabour. This research is descriptive and inferential. It also uses Halpean craft questionnaire and the validity shows 88%. 290 students were selected. Social settings of high schools i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006